Picture for Satoshi Sekine

Satoshi Sekine

Human-Grounded Multimodal Benchmark with 900K-Scale Aggregated Student Response Distributions from Japan's National Assessment of Academic Ability

Add code
May 12, 2026
Viaarxiv icon

Improving Methodologies for LLM Evaluations Across Global Languages

Add code
Jan 22, 2026
Viaarxiv icon

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Add code
Jul 04, 2024
Figure 1 for LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
Figure 2 for LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
Figure 3 for LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
Figure 4 for LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
Viaarxiv icon

Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance

Add code
Feb 22, 2024
Figure 1 for Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance
Figure 2 for Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance
Figure 3 for Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance
Figure 4 for Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance
Viaarxiv icon

WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia

Add code
May 10, 2023
Figure 1 for WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia
Figure 2 for WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia
Figure 3 for WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia
Figure 4 for WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia
Viaarxiv icon

Classifying Wikipedia in a fine-grained hierarchy: what graphs can contribute

Add code
Jan 22, 2020
Figure 1 for Classifying Wikipedia in a fine-grained hierarchy: what graphs can contribute
Figure 2 for Classifying Wikipedia in a fine-grained hierarchy: what graphs can contribute
Figure 3 for Classifying Wikipedia in a fine-grained hierarchy: what graphs can contribute
Figure 4 for Classifying Wikipedia in a fine-grained hierarchy: what graphs can contribute
Viaarxiv icon

Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag Set

Add code
Sep 14, 2019
Figure 1 for Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag Set
Figure 2 for Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag Set
Figure 3 for Multi-class Multilingual Classification of Wikipedia Articles Using Extended Named Entity Tag Set
Viaarxiv icon

Select and Attend: Towards Controllable Content Selection in Text Generation

Add code
Sep 10, 2019
Figure 1 for Select and Attend: Towards Controllable Content Selection in Text Generation
Figure 2 for Select and Attend: Towards Controllable Content Selection in Text Generation
Figure 3 for Select and Attend: Towards Controllable Content Selection in Text Generation
Figure 4 for Select and Attend: Towards Controllable Content Selection in Text Generation
Viaarxiv icon

Can neural networks understand monotonicity reasoning?

Add code
Jun 27, 2019
Figure 1 for Can neural networks understand monotonicity reasoning?
Figure 2 for Can neural networks understand monotonicity reasoning?
Figure 3 for Can neural networks understand monotonicity reasoning?
Figure 4 for Can neural networks understand monotonicity reasoning?
Viaarxiv icon

HELP: A Dataset for Identifying Shortcomings of Neural Models in Monotonicity Reasoning

Add code
Apr 27, 2019
Figure 1 for HELP: A Dataset for Identifying Shortcomings of Neural Models in Monotonicity Reasoning
Figure 2 for HELP: A Dataset for Identifying Shortcomings of Neural Models in Monotonicity Reasoning
Figure 3 for HELP: A Dataset for Identifying Shortcomings of Neural Models in Monotonicity Reasoning
Figure 4 for HELP: A Dataset for Identifying Shortcomings of Neural Models in Monotonicity Reasoning
Viaarxiv icon